Automatic Labeling of Intonation Using Acoustic and Lexical Features
نویسنده
چکیده
This paper proposes a framework of automatic intonation labeling which involves detection and classification of pitch accents and phrase boundaries. Four statistical models are designed to perform these tasks on the basis of a compact and simple representation consisting of features identified as the main acoustic correlates of accentual prominence and phrase boundaries or describing the acoustic-phonetic realization of different types of pitch accents and boundary tones. The features can be easily derived from utterance's acoustics (F0 and timing cues) and lexical features. The models yield high average accuracy between 80% and 87% depending on the task, and high consistency they approach the levels of agreement among human labelers in manual transcription of intonation.
منابع مشابه
Automatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features
Many acoustic approaches to prosodic labeling in English have employed only local classifiers, although text-based classification has employed some sequential models. In this paper we employ linear chain and factorial conditional random fields (CRFs) in conjunction with rich, contextually-based prosodic features, to exploit sequential dependencies and to facilitate integration with lexical feat...
متن کاملPredicting User Satisfaction from Turn-Taking in Spoken Conversations
User satisfaction is an important aspect of the user experience while interacting with objects, systems or people. Traditionally user satisfaction is evaluated a-posteriori via spoken or written questionnaires or interviews. In automatic behavioral analysis we aim at measuring the user emotional states and its descriptions as they unfold during the interaction. In our approach, user satisfactio...
متن کاملAutomatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features
Many acoustic approaches to prosodic labeling in English have employed only local classifiers, although text-based classification has employed some sequential models. In this paper we employ linear chain and factorial conditional random fields (CRFs) in conjunction with rich, contextually-based prosodic features, to exploit sequential dependencies and to facilitate integration with lexical feat...
متن کاملPerceptually-Related F0 Parameters for Automatic Classification of Phrase Final Tones
Automatic labeling of prosodic features is an important topic when constructing large speech databases for speech synthesis or analysis purposes. Perceptually-related F0 parameters are proposed with the aim of automatically classifying phrase final tones. Analyses are conducted to verify how consistently subjects are able to categorize phrase final tones, and how perceptual features are related...
متن کاملTone and intonation*
The objectives of this study are (i) to identify the basic components of pitch that can be isolated from tone and attributed to intonation; (ii) to establish them as the elements that must be accounted for in the transcription of an oral corpus. These components are meant to be available for typological studies of the relationship between these elements as they are employed for marking of lexic...
متن کامل